home *** CD-ROM | disk | FTP | other *** search
-
- Morning Tutorials: 8:30 a.m. - 12:30 p.m.
-
- Introduction to Information Retrieval
-
- Keith van Rijsbergen, University of Glasgow
-
- This tutorial will enable participants to reach an
- understanding of the science and engineering underlying
- information retrieval research and development. It is aimed at
- anyone who wants to:
- (a) do research in IR but has little or no basic knowledge in
- the subject;
- (b) examine the state of research and development in IR for
- commercial purposes;
- (c) teach IR at advanced undergraduate or postgraduate level but
- has no prior knowledge;
- (d) think about IR but has rarely thought about it before and
- does not know where to start.
- The tutorial will answer the following questions:
- (a) What is IR?
- (b) What are sensible models for IR?
- (c) How to measure things in IR.
- (d) What impact has IR research had on the existing technology?
-
- The tutorial will include a demonstration of a current system.
-
- The reward for participants a feel for the excitement of
- state-of-the-art in IR!
-
- Keith van Rijsbergen was born in Holland in 1943. He was
- educated in Holland, Indonesia, Namibia and Australia. He took a
- degree in mathematics at the University of Western Australia. As
- a graduate he spent two years tutoring in mathematics while
- studying computer science. In 1972 he completed a Ph.D. in
- computer science at Cambridge University. After almost three
- years of lecturing in information retrieval and artificial
- intelligence at Monash University he returned to the Cambridge
- Computer Laboratory to hold a Royal Society Information Research
- Fellowship. In 1980 he was appointed to the chair of computer
- science at University College Dublin; from there he moved in 1986
- to the Glasgow University where he is now, and indeed is head of
- department.
-
- Since about 1969 his research has been devoted to
- information retrieval, covering both theoretical and experimental
- aspects. He has specified several theoretical models for IR and
- seen some of them from the specification and prototype stage
- through to production. His current research is concerned with the
- design of appropriate logics to model the flow of information. He
- is also involved in several Esprit projects concentrating on the
- engineering issues associated with the building of IR systems. He
- is a fellow of the IEE and a member of the BCS. In 1993 he was
- appointed Editor-in-Chief of The Computer Journal. He has also
- recently been appointed a member of the Advisory Committee for
- GMD in Germany.
-
-
- Developing Information Retrieval Applications using Object
-
- Database Technology
-
- David J. Harper, University of Glasgow
-
- The field of information retrieval (IR) has yielded a range
- of techniques for efficiently and effectively storing and
- retrieving information items based on their text content. The
- utility of many application systems could be greatly enhanced by
- employing these techniques. Examples of such applications include
- office information systems, hypertext systems, and advanced
- information systems for government and industry. In this
- tutorial we describe how modern IR techniques, namely automatic
- indexing and best-match retrieval, can be efficiently implemented
- and conveniently delivered to application developers
-
- using object-oriented design and object database technology. We
- examine some sample applications and thereby demonstrate that the
- resultant IR framework, which takes the form of a class library,
- is both useable and extensible by application developers. We
- argue that our approach has a number of advantages over
- traditional mechanisms of delivery such as standalone packages
- and conventional programming language libraries. The tutorial
- is aimed at software designers and engineers who want to employ
- modern IR techniques within application systems, academics and
- researchers who are interested in implementing, experimenting
- with or prototyping IR systems, and managers wanting to assess
- the benefits of implementing IR applications using object
- database technology. Basic knowledge of IR and database concepts
- is assumed. A brief introduction to object-oriented concepts and
- object database technology will be provided.
-
- David J. Harper is a lecturer in the Department of Computing
- Science, University of Glasgow, Scotland. He is a principal
- investigator in Glasgow on the EEC-funded ESPRIT project
- Comandos, which is concerned with the construction and management
- of distributed open systems, and he leads a major research stream
- on object data management services. His other research interests
- include information retrieval systems, object-oriented design and
- programming, formal specification of database systems, and office
- information systems.
-
- He received his B.Sc.(Hons) in Computer Science from Monash
- University, Australia, in 1973 and his Ph.D. in Computing Science
- from the University of Cambridge, England, in 1980.
-
- Trends in Multimedia Development
-
- Alan Griffiths, University of Sheffield
-
-
- New products and applications for multimedia are being
- announced almost daily and there is a considerable hype over what
- is still a commercially insecure field with ill-defined standards
- and user groups. This tutorial will provide a stimulating
- overview of multimedia and will discuss a wide range of examples
- including computer-assisted-learning (CAL), image banks,
- electronic books and virtual reality. The currently available
- multimedia products and services will be examined to look for
- trends in development and these will be extrapolated into the
- future to propose products and services which will take advantage
- of the hardware, software and intellect combinations to create
- imaginative learning and information environments. This
- enthusiastic view will be tempered by discussing the very real,
- and unresolved, problems in designing large multimedia systems.
- The aims of the
- tutorial are:
- (a) To provide an overview of current and future trends in
- multimedia;
- (b) To provide thought-provoking possibilities for information
- products incorporating multimedia;
- (c) To stimulate discussion on how information retrieval
- techniques can and are being implemented into multimedia
- products.
-
- The attendees for this tutorial will be people interested in
- the stimulating prospects that multimedia offers, but not
- necessarily specialists in the field. Multimedia will not provide
- a solution to information retrieval but it can provide front-end
- systems which use the algorithms and techniques outlined
- elsewhere in the conference.
-
- Alan Griffiths is an independent consultant working in the
- areas of database and multimedia design, and is also a part-time
- lecturer in multimedia at the Department of Information Studies,
- University of Sheffield, England. Over the past ten years he has
- completed consultancies for a diverse group of clients ranging
- from heavy industry to public sector organizations. These clients
- have included Birmingham City Council, the Council for British
- Archaeology, Hewlett Packard, various parts of the British
- government, Basil Blackwells, Bridon Ropes and the Health &
- Safety Executive. His hazardous chemical software, produced for
- the Health & Safety Executive, is marketed by HMSO and used
- world-wide. His presentations on multimedia have been given to
- academic groups, software houses, companies and the public in
- England, France, Italy, Canada and Mexico.
-
-
- Afternoon Tutorials: 1:30 - 5:30 p.m.
-
- Natural Language Processing for Information Retrieval
-
- David Lewis and Elizabeth Liddy, AT&T Bell Laboratories and
- Syracuse University
-
- Our subject is the application of natural language
- processing (NLP) methods to information retrieval (IR) systems.
-
- We will discuss the characteristics of human language that make
- IR difficult, and will provide an extensive treatment of both NLP
- methods and their non-NLP alternatives for addressing these
- characteristics. We will also discuss the newer area of using IR
- components to aid NLP systems. This tutorial should be of
- interest both to researchers interested in past work and current
- research directions, and to application developers and managers
- who need to choose practical strategies for immediate
- implementation. Linguistic examples will be drawn primarily from
- English and Japanese, with a scattering from other languages. A
- bibliography on NLP and IR will be distributed.
-
- David D. Lewis (lewis@research.att.com) is a Member of
- Technical Staff at AT&T Bell Laboratories in Murray Hill, NJ,
- where his research areas are IR, NLP, machine learning, and their
- intersections. Lewis has implemented a variety of experimental
- and operational IR and NLP systems, as well as being an organizer
- for several large scale evaluations of such software.
-
- Elizabeth D. Liddy (liddy@suvm.syr.edu) is an assistant
- professor in the School of Information Studies at Syracuse
- University, where her research areas are discourse linguistics,
- semantic disambiguation, and the use of natural language
- processing in information systems. She is a co-principal
- investigator on the DR-LINK project funded under the DARPA
- Tipster initiative on text retrieval and filtering.
-
-
- Information Retrieval and Databases
-
- Norbert Fuhr, University of Dortmund, Germany
-
- Besides basic database functions such as concurrency,
- recovery, security and integrity, future IR systems will also
- need data modelling concepts for coping with structured documents
- and powerful query languages in order to ask for any kind of
- object in the database (for example authors, affiliations,
- journals). On the other hand, database systems are required to
- handle textual attributes and to provide appropriate text
- retrieval facilities. This tutorial will present specific
- database concepts which are important for both types
- of systems.
-
- After a short introduction to basic database concepts, three
- major lines of research will be considered: modelling data
- structure (relational and NF2 models); modelling structure and
-
- behavior (object-oriented databases); and modelling uncertainty
- (Fuzzy, Bayesian and Dempster-Shafer approaches). For each of
- these approaches, examples for their application to IR problems
- will be presented.
-
- The target audience is those who have basic knowledge in IR
- and now want to learn about the database aspects of the field.
- The course would be most useful for people designing or building
- IR or database systems.
-
- Norbert Fuhr is professor in the computer science department
- of the University of Dortmund, Germany. He holds a diploma and
- doctorate in Computer Science from the Technical University of
- Darmstadt. He is well-known for his theoretical and experimental
- work on probabilistic IR models. Recently, his research
- interests have focused
- on the application of IR methods to factual databases and the
- integration of IR and database systems.
-
-
- Non-Textual Compression in Full-Text IR Systems
-
-
- Shmuel T. Klein, Bar-Ilan University, Israel
-
- Data compression has received increased attention lately,
- resulting in a large number of recent publications in this area.
- The tutorial presents an overview of these recent developments.
- We shall first classify the various types of files that appear in
- a full-text
- retrieval system. The focus will be on the auxiliary structured
- files, such as dictionaries, concordances and various forms of
- bitmaps, (which may significantly enhance the retrieval
- algorithms) and will present state of the art approaches for
- compressing them. The tutorial
- is intended for researchers, designers and users of large textual
- IR systems. By the end of the tutorial, the participant should be
- able, for a given file, to choose the appropriate compression
- technique and design his/her own encoding and decoding routines.
-
- Shmuel T. Klein received his Ph.D. from the Weizmann
- Institute of Science in Israel. He then spent three years at the
- University of Chicago and is now at Bar-Ilan University in
- Israel. Dr. Klein has worked on two of the world's largest
- natural language full-text information retrieval systems: the
- Responsa Project at Bar-Ilan University (in Hebrew),
- and the Tresor de la Langue Francaise at the University of
- Chicago (in French), and has published several papers on various
- aspects of compression in IR systems.
- - -- - - - - - - - ---more on IANET '93- - - ---- -
- INFORMATION ACCESS AND THE NETWORKS
- A Research Workshop (held after SIGIR'93)
- July 1, 1993 --- Pittsburgh, PA
-
- Information access involving networks is one of the most important
- growth areas in the broad field of information technology. User
- demand has caused a massive growth in the size of the Internet and
- the
- amount of information accessible on it. This has made it imperative
- to develop new technologies to provide an infrastructure for this new
- information space. This workshop will bring together researchers and
- developers to discuss network-based information services in both
- general and specific terms, drawing upon those attending ACM SIGIR
- '93
- and the communities related to:
- * Archie
- * Digital Libraries
- * Gopher
- * HyperPage
- * Knowbots
- * NCSA Mosaic
- * NNTP/Usenet
- * Prospero
- * Veronica
- * WAIS
- * WWW (WorldWideWeb)
- * Z39.50
- along with other related initiatives, protocols, systems, and
- services.
-
- Attendees will include persons with an active interest in
- network-based information systems and relevant research.
-
- Presentations will include design and implementation of
- client-server architectures, resource discovery, distributed file
- systems and services, development of integrated network tools, and
- innovative
-
- applications of more basic technologies, such as bulletin board
- systems and
-
- electronic mail.
-
- The workshop format involves a plenary session from 8:30-11:30 a.m.
- with invited speakers, each of them addressing a major theme in the
- area of wide-area networking. After some discussion, the workshop
- will
- break up into groups, starting with a working lunch. The
- organization
- of these groups and assignment of participants to them will be based
- on a short interest statement provided by each participant.
-
- The workshop will conclude with brief summary presentations by
- group leaders. Notes from the plenary talks and interest
- statements will be provided to attendees and made available
- for online access after the workshop.
-
- In addition to discussion of the initiatives, protocols, systems, and
- services listed above, other topics of interest include:
- * scaling - to handle more users, more databases or
- documents,
- bigger databases, larger data items (e.g.,
- multimedia)
- * efficiency - algorithms, data structures,
- simulations, experiments
- * effectiveness - evaluation methods, studies, designs
- * interfaces - platforms, user needs, development methods,
- evaluations, integration with other applications
- * limitations and future plans for enhancement
- * progress in developing standards or using existing ones
- * technical details relating to protocols, implementations
- * user studies, application surveys, innovative uses
- * integration - plans, designs, requirements,
- implementations
- of systems to integrate information access over
- the networks
- * novel applications using network-enabled information
- access
- technology.
-
- If you are interested in attending, please prepare a 1-page statement
- of interest, covering your background, experience, and topics of
- interest. Priority, in case of limited space, will be given to SIGIR
- attendees and those who have submitted interest statements.
-
- The Program Committee includes:
- George Brett, CNIDR
- Edward Fox, Chair (of workshop, SIGIR), Virginia Tech
- Jose-Marie Griffiths, U. Tenn.
- Brewster Kahle, WAIS Inc.
- Clifford Lynch, U. California
- Craig Stanfill, Thinking Machines
- Craig Summerhill, Coalition for Networked Information
- Chris Tomer, Co-chair and local arrangements coordinator
-
- Interest statements should be submitted by May 15. They may
- accompany
-
- registration forms or be sent directly to:
- Chris Tomer
- School of Library and Information Science
- University of Pittsburgh
- Pittsburgh, PA 15260, USA
- tel: (412) 624-9448; fax: (412) 648-7001
- email: ianet@lis.pitt.edu
-
- Further information is available from:
- Ed Fox, (fox@fox.cs.vt.edu; foxea@vtcc1.bitnet) or
-
- Chris Tomer (ianet@lis.pitt.edu)
-
-
- Cost for this one-day session is $50,
- which includes coffee breaks and lunch.
-
- ------- End of Forwarded Message
-
-
-
-